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Abstract — The remarkable promise of multiple-input multiple- 
output (MIMO) wireless channels has motivated an intense 
research activity to characterize the theoretical and practical 
issues associated with the design of transmit (source) and re- 
ceive (destination) processing matrices under different operating 
conditions. This activity was primarily focused on point-to-point 
(single-hop) communications but more recently there has been an 
extensive work on two-hop or multi-hop settings in which single 
or multiple relays are used to deliver the information from the 
source to the destination. The aim of this tutorial is to provide 
an up-to-date overview of the fundamental results and practical 
implementation issues of designing amplify-and-forward MIMO 
relay systems. 

Index Terms — Tutorial, MIMO, optimization, transceiver de- 
sign, amplify-and-forward, non-regenerative relay, power alloca- 
tion, majorization theory, quality-of -service requirements, single- 
hop, two-hop, multi-hop, one-way, two-way, multiple relays, 
perfect channel state information, robust design. 



I. Introduction 

MULTIPLE-input multiple-output (MIMO) relay com- 
munications are viewed as one of the most promising 
techniques to improve the reliability and coverage of wireless 
systems. While the optimization of point-to-point (single-hop) 
MIMO systems has been widely analyzed (an excellent survey 
on this topic can be found in |fT)). the optimization of MIMO 
relay networks has gained much attention only recently. The 
aim of this tutorial is to provide an overview of the results 
obtained in this area. Due to the considerable amount of work 
in this field and the rapidly intensifying efforts at the time 
of writing, our exposition will be necessarily incomplete and 
will reflect the subjective tastes and interests of the authors. To 
compensate for this partiality, a list of references is provided as 
an entree into the extensive literature available on the subject. 

As is well-known, a first operating distinction in relay 
communications is made on the way the received signals are 
processed by the relays. This can be done according to several 
different protocols such as decode-and-forward, amplify-and- 
forward, compressed-and-forward, mixed-forward and so forth 
(see for example (2] and references therein). The simplest 
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one is the amplify-and-forward (AF) protocol in which non- 
regenerative relays are used to linearly process the received 
signals and to re-transmit them toward the destination. Though 
inherently affected by noise propagation effects, the AF pro- 
tocol is nowadays considered as the most promising solution 
for future and/or existing wireless communications since it 
provides a reasonable trade-off between benefits and practical 
implementation costs. Among the different AF relay systems, 
the simple two-hop model (in which the information is passed 
from the source to the destination using one or more parallel 
relays) has been the focus of much ongoing research. For this 
reason, this tutorial is largely dedicated to the analysis of two- 
hop architectures while the multi-hop case is reviewed only 
briefly. In addition, to simplify the exposition, the single relay 
scenario is almost exclusively considered. 

A second operating distinction can be made between full- 
duplex and half-duplex systems depending on whether relays 
can transmit and receive simultaneously or not. This work 
is focused on half-duplex systems since several practical 
constraints such as power consumption, implementation costs 
and spatial efficiencjQ make them more appealing for wireless 
applications. 

This tutorial is organized as follows!. The optimization of 
a one-way two-hop MIMO system is considered in Section 
II - the largest one of this work in view of the considerable 
attention devoted to such systems. The signal model for linear 
architectures is first introduced under the assumptions of a 
frequency-flat propagation channel and a negligible source- 
destination link. Two different optimization problems are 
considered. In particular, the first is focused on the minimiza- 
tion/maximization of a global objective function subject to 
average power constraints at the source and relay nodes while 
the second aims at minimizing the total power consumption 
while satisfying specific quality-of-service requirements. Next, 
our analysis is extended to non-linear architectures as well as 
to frequency selective channels. We also examine the problem 
of acquiring channel state information at all nodes and describe 
the distinctive features of some robust optimization solutions. 

'Full-duplex systems require an opportune spatial separation between 
transmit and receive antennas in order to reduce loop-back interference. 

2 The following notation is used throughout the paper. Boldface upper and 
lower-case letters denote matrices and vectors, respectively, while lower-case 
letters denote scalars. We use A = diag{a n ; n — 1,2, ... , K} to indicate a 
K xK diagonal matrix with entries a n while A = diag{ Ai , A2 , . . . , Ax } 
stands for a block diagonal matrix. The notations A -1 and A 1 / 2 denote the 
inverse and square-root of a matrix A. We use Ik to denote the identity 



matrix of order K while 



indicates the (fc,£)th entry of the enclosed 



matrix. In addition, we use E { ■ } for expectation, the superscript T and H 
respectively for transposition and Hermitian transposition. 
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Fig. 1. Block diagram of a linear one-way two-hop linear MIMO system. 



Section III is devoted to the optimization of a multi-hop 
relay network, while in Section IV a one-way two-hop MIMO 
system with multiple parallel relays is considered. The direct 
link is investigated in Section [V] while the most recently 
advanced solutions for the optimization of a two-way two-hop 
MIMO system are reviewed in Section [VI] Finally, in Section 
VII we summarize some interesting open issues that are likely 
to be the basis for future research in the optimization of relay 
networks. 

II. Optimization of a one-way two-hop MIMO 

SYSTEM 

The one-way class refers to conventional two-hop systems 
in which four phases are needed to exchange information 
between source and destination via the relay link. A first phase 
is required to convey data from source to relay, a second one is 
needed to go from relay to destination, and two other phases 
are then required for reverse link. The block diagram of a 
one-way two-hop MIMO system in which linear processing is 
employed at all nodes is shown in Fig. Q] We start considering 
the case in which the direct link is negligible due to large path 
attenuation^. This assumption will be removed in Section [V] 
For simplicity, we also restrict our attention to a frequency 
flat-fading channel while the extension to frequency-selective 
fading channels will be discussed only later. 

A. Signal model 

The fcth symbol is denoted by Sk and is taken from a 
quadrature amplitude modulation (QAM) constellation with 
an average power normalized to unity for convenience. We 
assume that the source and destination are both equipped 
with Ns antennas while the relay employs Nr antennaQ In 
addition, we denote by K < min (Nr, Ns) the total number 
of transmitted symbols. 

As shown in Fig. Q] the source vector s = [s±, S2, ■ ■ ■ , Sk] T 
is first linearly processed by a matrix U € C sX and then 
transmitted over the source-relay link in the first phase. At the 
relay, the received signal is first processed by F 6 (^n r xn r 
and then forwarded to the destination during the second phase. 
Vector y at the input of the decision device is eventually given 
by 10 

y = GHUs + Gn (1) 



This choice is motivated by the fact that a relay plays a much more 
important role when the direct link is weak than when it is strong. 

4 The results can be easily extended to a more general case in which different 
number of antennas are available at source and destination. 



where G E 
destination, 



is the processing matrix used at the 
H = HrdFHsjj (2) 



is the equivalent channel matrix, Hsr € ^n r xn s an( j 
Hrd € C sX H are the source-relay and relay-destination 
channel matrices, respectively. In addition, n <G (£ NsXl is a 
zero-mean complex Gaussian vector whose covariance matrix 
is pRn with p > accounting for the noise variance over both 
links^l and 
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Henceforth, we denote by 



H 



SR 



A 1/2 -xtH 

Hsr Hsr Hsr 



and 



H 



RD 



= n 



A 1/2 v ff 
Hrd jv H rd V H rd 



(3) 



(4) 



(5) 



the singular value decompositions (SVDs) of Hsr and 
Hrd and assume without loss of generality that the en- 
tries of the diagonal matrices Ah sr and A-h rd are ar- 
ranged in non-increasing order. This amounts to saying that 
^H SR ,k > ^H SR ,k+i and \H RD ,k > Ajy HD ,fc+l f° r k = 
1,2,..., min (Nr, N s ) - 1, where \ HsR ,k and X HRD ,k stand 
for the fcth diagonal element of Ah sr and Ajj Rn , respectively. 
Also, we denote by 



E = E{(y-s) (y-sf} 



(6) 



the mean square error (MSE) matrix. From ([T] using (0 and 
©, it follows that 

E = (GHU - l K ) (GHU - l K ) H + P GR n G H . (7) 



B. Problem formulation 

A popular approach in the design of AF MIMO relay 
systems is to maximize the capacity between source and 
destination (see (3| and j4] and references therein). Although 
the capacity is one of the most important information-theoretic 
measure, there are many other ways of characterizing the 
reliability of transmission schemes. Most of them relies on 
the minimization/maximization of a global objective function 
/ : M, K — > R subject to average power constraints at 
the source and relay nodes. Henceforth, we assume that / 
depends on the single MSEs {[E]/^.; k = 1,2,..., A"} and 
formalize such optimization problems as follows J5] (see 0] 
and references therein for a detailed discussion on the subject 
in single-hop MIMO systems): 



Vi: mm F f[m k<k ;k = l,2,...,K 



(8) 



s.t. tr{UU^} < P s 

tr{F (H SR WJ H H SR H + P I Nr ) F h }<P r 

(9) 

where Ps and Pr denote the power available for transmission 
at the source and relay, respectively. Following [T|, we restrict 

5 The extension to the case in which the noise contribution over each link 
has a different variance is straightforward. 
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our attention only to reasonable /, i.e., functions that are 
increasing in each argument. 

An alternative approach is based on the minimization of 
the total power consumption while meeting specific quality- 
of-service (QoS) requirements on the different data streams. 
In particular, we assume that the QoS constraints are given in 
terms of the MSEs so that the problem can be formalized as 
J6l (see for example [f]~3l for single-hop MIMO systems): 

V 2 : min tr {U H U + F (H SR XJV H H SR H + P I Nr ) F h } 

G.U,F 

(10) 

s.t. [E] M < i lk for k = 1, 2, . . . , K 

where i]k > specifies the QoS requirement for the fcth data 
stream. 

If not otherwise stated, in the following derivations we 
assume a perfect knowledge of Hsr and Hrd at all nodes. 

C. Design of (G, U, F) for problem V\ 

Since / is increasing in each argument, the optimal G in 
dH) must be such that each [E] fc k is minimized for any given 
(U, F) J5|. As is well known, this is achieved by choosing G 
equal to the Wiener filter, i.e., 

G = \J H H H (HUU ff H H + P R n ) ~ 1 . (11) 

Substituting (fTTT i into © yields 



If / is additively Schur-concave then the optimal U and F 
in ([8]l are given by ||5| 



U = V 



A 1/2 



and 



Hsr 



(14) 



(15) 



E = p (U^H^R^HU + pl K ) 



(12) 



It is worth observing that when the optimal G is equal to the 
Wiener filter the signal-to-interference noise ratio (SINR) over 
the fcth stream is related to the corresponding MSE as follows 

SINR fe = —J 1. (13) 



[El 



k.k 



This means that the optimization problem in I© encompasses 
also all design criteria in which the objective function is 
expressed in terms of SINRs JT] . 

As shown in 0, the optimal U and F can be computed 
in closed-form for additively Schur-concave or Schur-convex 
functions. As originally pointed out in \\\, this class of 
functions is of great interest since many different optimization 
criteria driving the design of wireless communication systems 
arise in connection with it. The interested reader is referred to 
(UJ for a more detailed discussion on the subject. 

1 ) Additively Schur-concave functions: A short list of op- 
timization problems in which / is additively Schur-concave is 
given below: 

• the maximization of the mutual information f([E]k,k',k — 
l,2,...,J0 = -£f =1 log[E] fe , fe ; 

• the minimization of the product of the MSEs 

K 



/([E] fe , fe ;fc = l ) 2,...,JC) = nfc=i[E] fe , fe ; 

• the maximization of the sum of the SINRs f([E]kk',k — 

i,2,...,J0 = -l£i(pfc-i); 

• the maximization of the product of the SINRs 
/([E] feife ; k = 1,2,...,K) = - nf=i(p^ - !)■ 

In writing the above list, we have used the fact that when 
the Wiener filter is used at the destination the SINR is related 
to the MSE through ( TBI 



where V// SH , V# H£) and ^ ss are obtained from the K 
columns of V# SR , Vjj bd , and £Ih S r associated to the K 
largest singular values of the corresponding channel matrix. 
In addition, Ay £ C and Ap £ C are diagonal 
matrices with elements given by 



and 



A 



Ak 



Bi 



F.k 



(16) 



(17) 



Ak^H SR ,k + P 

where XH SR ,k denotes the fcth diagonal entry of Ah sr in ©. 

The coefficients Ak and Bk for fc = 1,2, ...,K account 
for the transmission power required by the fcth stream at the 
source and relay, respectively, and are obtained as the solutions 
of the following problem: 



min f(^) 

{A k >0.B k >0} 



(18) 



s.t. 



jt Ak<Ps 

k=l 



and £ B k < Pr 



k=l 



where A = [Ai, A2, . . . , \k] T and Afc is the fcth eigenvalue of 
E. The latter is obtained substituting (O and (O into (fT2l 
and is given by 

Ak^H S n,k + Bk^HRD ,k + P 



A,, 



Pi 



(19) 



{A k X HsR:k + p) {B k \ HRD ,k + p) 

where \H RD ,k denotes the fcth diagonal entry of Ah rd in (0). 

From ([Pil l and (1151 . it follows that the optimal U and 
F match the singular vectors of the corresponding channel 
matrices. In this way, the strongest spatial channels of the 
source-relay and relay-destination links are matched together. 

Collecting the above results together, it is easily seen that 
the overall channel matrix T-L = GH^FHs^U becomes 
diagonal with entries given by 

^U,k^H S R,k^F,k^H RD ,k 



A 



^U,k^H SR ,k^F,k^H RI },k + P {^F,k^H RD ,k + 1) 

Also, the MSE matrix turns out to be diagonal with elements 
given by 

[E] fc , fc = A fc . (20) 

From the above results, it follows that the AF MIMO relay 
system becomes equivalent to a set of parallel single-input 
single-output (SISO) channels. This is depicted in Fig. [2] when 
S is chosen equal to the identity matrix. A similar result was 
obtained for single-hop systems JT] . 

Now the only problem left is to solve (118t . Once {A k } and 
{Bk} are computed, the optimal allocation of the available 
power over the parallel SISO channels of Fig. |2]can be found 
through (fT6l l and d!7t . While in single-hop systems the optimal 
power distribution can easily be found by means of water- 
filling inspired algorithms \ \ \ , solving (fT8l represents the most 
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Fig. 2. Equivalent block diagram of the optimized one-way two-hop MIMO 
system when the direct link is omitted. When / is additively Schur-concave, 
the matrix S must be chosen equal to the identity matrix. On the other hand, 
when / is additively Schur-convex S is unitary and must be designed such 
that (25) is satisfied. 



challenging task since it is not in a convex form. A possible 
solution to this problem is represented by the grid-search 
based algorithm illustrated in Q. The latter was originally 
proposed for the maximization of the mutual information in 
one-way two-hop SISO multicarrier systems and has been 
recently extended to a generic function of the MSEs in J8] 
only for the simple case in which a uniform power allocation 
is adopted at the source. Unfortunately, the algorithm in Q is 
computationally intensive since a high-dense two-dimensional 
grid-search whose complexity grows quadratically with the 
number of subcarriers (spatial channels in the system under 
investigation) is required to obtain good approximations of the 
global minimum. To overcome this problem, the authors in 
propose also an alternative approach based on an heuristic line 
of reasoning in which the power is allocated separately at the 
source and relay by means of a water-filling algorithm operat- 
ing over a progressively searched subset of subcarriers/spatial 
channels. This leads to a suboptimal procedure with linear 
complexity whose solution is shown in Q to be close to the 
optimal one. 

An alternative approach that may guarantee a good trade- 
off between complexity and performance is achieved with the 
method proposed in J3] and (9], where (Q~8) is alternately 
solved with respect to {Ak} or {Bk} keeping the other fixed. 
This leads to an iterative optimization procedure that if prop- 
erly initialized monotonically converges to a local optimum 
of ( TT~8b since the conditional updates of {Ak} and {Bk} may 
either decrease or maintain (but not increase) the objective 
function /(A). Interestingly, the minimization of /(A) in ( fT~8l > 
with respect to {Ak} (or {Bk}) when {Bk} (or {Afc}) is fixed 
leads to a water-filling inspired solution for most of the Schur- 
concave functions of interest 0. However, as with any other 
iterative algorithms some caution must be taken in applying 
the above scheme since a bad initialization can prevent it 
from converging to a local optimum. For example, in Q the 
authors observe that a uniform power allocation may not be 
a good initial point in fully spatial-correlated channels, i.e., 
AflsR.fc = Xh sr and ^H RD ,k = Xh rd for all k = 1, 2, . . . ,K. 

An approximate solution of (TToT l can be obtained in closed- 
form (without the need of any iterative procedure) through the 



low-complexity algorithm proposed in IfTOI . However, such a 
method works properly only when the power budgets Pg and 
are sufficiently higher than the noise variance p. 
2) Additively Schur-convex functions: A short list of opti- 
mization problems in which / is additively Schur-convex is 
given below (see for example |Tj and J5] for more details): 

• the minimization of the sum of the MSEs f([E]fc k = 

i>2,...,tf) = £f=i[E] fc , fc ; 

• the minimization of the maximum MSE f([E]k.k',k = 
1, 2, . . . , K ) = maxi< fe <K [E] fc k ; 

• the minimization of the harmonic mean of the SINRs 

/([E] fc , fc ;fc = l,2 J ... ) X) = Ef =lT J fe; 

• the maximization of the minimum of the SINRs 

f(\E]k,k\ k = l,2,...,K) = - min^kKK ^g^f- 
If / is additively Schur-convex then the optimal matrices U 
and F in (O are given by ||5l 

l/2 a H 



and 



Vff 



\ 1/2 n H 



(21) 



(22) 



where S e C x is unitary while the entries of the diagonal 
matrices Ajj € C KxK and Ap € C KxK are still given by 
( fT6l ) and ( 1171 , The quantities {Ak} and {Bk} are now obtained 
as: 

A' 



mm 

{A k >0,B k >0 



s.t. 



A k \ 



Bk\H RD ,k 



k=l 



A 



{Ak\H S R,k + P) {Bk\H RD .k + P) 



zZA k <Ps 

k=l 



and 



E B k < Pr- 

k=l 



(23) 



The above problem is still not convex and the same techniques 
illustrated previously can be applied to obtain a suboptimal 
solution. Interestingly, it is seen that the power allocation 
problem (l23l does not depend on the particular choice of /. 

As shown in Fig. [2] when / is an additively Schur-convex 
function the optimal structure of the relay system is diagonal 
up to a unitary matrix S. The latter must be chosen such that 
the diagonal elements of the MSE matrix E, now given by, 




E = SAS 



H 



(24) 



are all equal to the arithmetic mean of its eigenvalues, i.e., 

A" 



k,k 



K 



(25) 



where {A;} are still of the form in (|T9b . If if is a power 
of two, the above condition can easily be met choosing S 
equal to the discrete Fourier transform matrix or to a Walsh- 
Hadamard matrix. Otherwise, S can be found through the 
iterative procedure described in (QT|. 

Fig. |3] illustrates the bit-error-rate (BER) of a 4-QAM 
constellation as function of the signal-to-noise ratio (SNR) on 
the source-relay link for different optimization criteria when 
N s = N R = 3, K = 2 and the SNR over the relay-destination 
link is fixed to 20 dB. Comparisons are made among the 
designs based on three Schur-concave functions: the maxi- 
mization of the mutual information (MI), the maximization of 
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Fig. 3. BER of a one-way two-hop MIMO system as a function of the SNR 
over the source-relay link when the destination is equipped with a linear or 
non-linear (DFE) receiver with Ng = Nr = 3 and K = 2. In addition, the 
SNR over the relay-destination link is fixed and equal to 20 dB. 



the product of the SINRs (Prod - SINR) and the minimization 
of the sum of the MSEs (Sum - MSE); and also on a Schur- 
convex function^ the minimization of the maximum MSE 
(Max - MSE). The alternating algorithm developed in |5 | and 
l9l is used to allocate the available power at source and relay. 
It is seen that the Schur-convex design outperforms Schur- 
concave ones while all the investigated solutions perform 
consistently better than the NAF (naive AF) design in which 
U and F are set equal to scaled identity matrices. It is worth 
observing that MI is a good criterion only for coded systems in 
which the number of symbols for each coding block is large. 
On the other hand, the simulation setup of Fig. 3 refers to 
uncoded systems with a small number of symbols (4-QAM 
and K = 2) for each block and comparisons among the 
different schemes are made in terms of raw BER. In these 
circumstances, it is not surprising that the Mi-based algorithm 
does not yield a better performance than other solutions based 
on different criteria. Moreover, the fact that the BER of a 
Schur-convex driven design is smaller than that of a Schur- 
concave one is not surprising and it is in accordance with 
the results illustrated in [1] for single-hop systems. A simple 
explanation relies on the following observation. As shown 
in d25l l, when a Schur-convex design is applied the MSEs 
are all equal to the arithmetic mean of the quantities {A&}. 
On the other hand, from ( |20b it follows that when a Schur- 
concave design is employed the fcth MSE is simply equal to 
Xk- Since the average BER of a MIMO system is dominated 
by the spatial stream with the largest MSE, it follows that a 
Schur-concave design cannot provide better performance than 
a Schur-convex one as long as the quantities {A^} are different. 



"Observe that any other Schur-convex function would provide the same 
performance of the considered one as all of them lead to the same solution. 



D. Design of (G, U, F) for problem P 2 

A close inspection of ( fTOb reveals that the best we can do is 
to choose G so as to minimize each MSE |fl3l . This means that 
the optimal G in ( fTOb is still given by the Wiener filter (fTTT i. 
On the other hand, the optimal U and F have the following 
form il 



U = V 



Hsr^U 



l/2 a H 



and 



V 



(26) 



(27) 



iKxK 



where S 6 C"^" is unitary and such that the diagonal 
elements of E = SAS ff satisfy the following condition 



[El 



k.k 



for k = 1,2. 



. , K. 



(28) 



The entries of the diagonal matrices An and Af are still 
obtained as in ( [Tol l and dT7b with Ak and Bk solutions of the 
following problem (the interested reader is referred to [2] for 
more details): 



mm 

{A k >0},{B k >0} 



K 

E 

fc=i 



K 



k=l 



(29) 



s.t. E^<E% fori = i,2,...,is: 

0~< A fc < A fc+ i < 1 k=l,2,...,K-l. 

As for the minimization of additively Schur-convex functions, 
the optimized structure is diagonal up to a unitary matrix S. 
The latter must be now designed such that (f28b is fulfilled. For 
this purpose, the same iterative procedure mentioned before 
and illustrated in ifTTI can be used. 

Finding the solution to d29b is again the major challenge of 
the optimization. A possible approach is discussed in J6) in 
which the optimal solution is upper- and lower-bounded using 
the geometric programming approach and the dual decompo- 
sition technique, respectively. Unfortunately, the computation 
complexity of both solutions is relatively high so as to make 
them unsuited for practical implementation. A reduced com- 
plexity algorithm is derived in lfT2l in which the optimization 
in d29b is first carried out over Ak and Bk for a fixed A& 
and then over all possible A& within the feasible set of ( [29] ). 
As shown in [12], this approach allows to approximate the 
original problem with a convex one, whose solution can be 
computed in closed-form through a multi-step procedure that 
requires no more than K — 1 steps. 

Fig. 2] illustrates the total power consumption in dB when 
the noise variances over the source -relay and relay-destination 
links are both equal to dB and N$ = Nr = K = 3 while 
the QoS constraints are for simplicity assumed to be identical, 
i.e., rji = ij2 = T]3 = V- The curve labelled with GP refers to a 
system in which the power allocation problem is approximated 
using the geometric programming approach proposed in 
while RC corresponds to the reduced complexity algorithm 
developed in 11211 . Comparisons are made with SA (sub- 
optimal algorithm) in which the unitary matrix S in ( f26b is set 
equal to the identity matrix [6 J. As seen, GP and RC provide 
substantially the same performance and achieve a remarkable 
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Fig. 4. Power consumption in dB when the destination is equipped with a 
linear or non-linear (DFE) receiver and Ng = Nr = K = 3 while equal 
QoS constraints are imposed, i.e., r\\ = r/2 = »?3 = r). In addition, the noise 
variances over the source-relay and relay-destination links are both fixed to 
dB. 



gain with respect to SA. In |fl~Z), it is shown that the amount 
of power required by RC is very close to the minimum. 

Remark. It is worth observing that in practical applica- 
tions source and relay may be unable to meet all the QoS 
requirements due to their limited power resource or due to 
regulations specifying the maximum transmit power. This 
calls for some countermeasures. A possible way out to this 
problem (not investigated yet) is represented by the technique 
illustrated in lfl3l for single-hop MIMO systems in which the 
QoS constraints that produce the largest increase in terms of 
transmit power are first identified and then relaxed using a 
perturbation analysis. An alternative approach is to make use 
of an admission control algorithm such as the one illustrated 
in lfT4l for multi-user single-antenna relay systems in which 
the power minimization problem is carried out jointly with 
the maximization of the number of users that can be QoS- 
guaranteed. 

E. Extension to non-linear architectures 

As is well-known, a decision feedback equalizer (DFE) 
provides a significant performance gain with respect to a linear 
one in single-hop MIMO systems either in terms of BER or 
system capacity. Similar results have recently been found also 
in one-way two-hop MIMO systems (see for example J6|, 
|[T2l and lfl5l ) using the same mathematical tools employed 
in single-hop systems. These are largely represented by the 
multiplicative majorization theory and the equal-diagonal QR 
decomposition method illustrated in JT6). As an alternative to 
DFE, non-linear prefiltering based on Tomlinson-Harashima 
precoding (THP) has also gained some attention lfl7l - fl9l . 

Due to space limitations, we limit our attention to a DFE- 
based architecture for which (under the assumption of correct 
previous decisions) the vector y at the input of the decision 



device is given by y = (GHU — B) s + Gn where B G 
C KxK is strictly upper triangular and known as backward 
matrix JT5]. 

1) Design of (G,U,F,B) for problem V\: In (B), it is 
demonstrated that if the objective function / is multiplicatively 
Schur-concave0 the optimized non-linear architecture reduces 
to the linear one discussed previously. This means that there 
is no advantage in using a DFE at the destination when 
multiplicatively Schur-concave functions are considered. On 
the other hand, if / is multiplicatively Schur-convex a different 
result is found in IT31 . In particular, it is shown that the 
optimal G is the Wiener filter and the optimal B is equal 
to B = DL H — Ik where L G C KxK is lower triangular and 
such that 

LiL H = U ff H ff R n 1 HU + pI K 

while D G C KxK is diagonal and designed so as to scale to 
unity the entries [DL^]^ for k = 1, 2, . . . , K. The optimal 
U and F take the form [TT5l 



U = Vh^A^S 11 



and 



v ffsB A!/ 2 n fl 



Hsr 



(30) 



(31) 



where S G C KxK is unitary while the entries of the diagonal 
matrices Ajj and Ap are in the same form of (fTol and (fTTI i 
but with {A k } and {B k } obtained solving: 

K 



mm 

{A k >0,B k >0} 



Ak^H SR ,k + Bk\H RD ,k + P 



k=l 



(A k \ HsR , k + p) {B k X HRD ,k + p) 



s.t. 



E A k <P s and £ B k < P R . 

k=l k=l 



(32) 



Now, the unitary matrix S is such that (see 1(151 for more 
details) 

/ K \~ 



k.k 



n 



(33) 



is satisfied with \ k defined as in ( fT9l ). This is achieved through 
the iterative algorithm illustrated in 1120) . In addition, the MSE 
matrix turns out to be given by 

-l 



E 



LL 



= SAS 



(34) 



from which using ( 1331 it follows that its diagonal elements are 
all equal to the geometric mean of its eigenvalues |[T5l 



A" 



[E" 



fe.fe 



(35) 



At this stage, we observe that every increasing additively 
Schur-convex function is multiplicatively Schur-convex as 
well. Consequently, the additively Schur-convex functions 
analyzed in and reported in the previous section can easily 
be accommodated in the above framework (see 0] and ||221 for 



7 Due to space limitations, we do not report a list of multiplicatively Schur- 
concave or Schur-convex functions and limit to observe that they play the 
same role as additively Schur-concave or Schur-convex functions in the linear 
case. The interested reader is invited to refer to JTJ for more details. 
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further details on this subject). Moreover, from (132) it follows 
that, similar to the linear case, with additively Schur-convex 
functions the power allocation problem with multiplicatively 
Schur-convex functions is independent of / and the optimal 
processing matrices lead to a channel-diagonalizing structure 
provided that the symbols are properly rotated at the source 
and destination by the unitary matrix S. 

Finding the globally optimal solution of ( f32b with algo- 
rithms of affordable complexity is extremely hard since the 
problem is not in a convex form. However, locally optimal 
solutions can be obtained resorting to the same methods 
illustrated previously for the linear case [TT31 . 

In Fig. [3 the curve labelled with "Max - MSE with DFE" 
refers to a system in which the destination is equipped with 
a DFE and the design is made according to a multiplicatively 
Schur-convex function: the minimization of the maximum 
MSE. The available power is allocated using the alternating 
algorithm proposed in J5] and J9). As expected, a non-linear 
system provides better performance than a linear one. This 
advantage is lost if a multiplicatively Schur-concave function 
is chosen since in these circumstances the non-linear system 
reduces to the linear one. 

2) Design of (G,U,P,B) for problem V 2 : When the 
power minimization problem is considered, in J6] it is shown 
that the optimal (G,U, F,B) have the same form as before 
for the case of multiplicatively Schur-convex functions with 
the only differences that S is such that [L]^ = y/rjk for 
k = 1,2,..., A" and the quantities {Ak} and {Bk} are 
solutions of the following problem: 



K K 

mm y^Ak+y^Bk 

{A k > 0},{B k > 0} ^ 



k=l 



k=l 



(36) 



s.t. 



fl A fe < fl Vk for j = 1,2,. 

k=l fc=l 

0<A fe <A fe+ i<l k = l,2, 



,K 



1. 



Using the same arguments adopted for the linear case, in J6] an 
upper- and a lower-bound to the globally optimal solution of 
the above non-convex problem are computed. Alternatively, 
the reduced-complexity procedure developed in ifTZl can be 
used. 

In Fig. H the curves labelled with GP-DFE and RC-DFE 
refer respectively to a system in which the successive GP 
approach of J6] and the algorithm developed in |[T2"1 are 
employed in conjunction with a DFE. As seen, both solutions 
require substantially the same power and largely outperform 
the corresponding ones obtained with a linear receiver for all 
the investigated values of rj. 

F. Extension to frequency selective fading channels 

As done in single-hop MIMO systems, the above opti- 
mization procedures can be extended to frequency-selective 
fading channels using multicarrier transmissions. To see how 
this comes about, assume for example that an orthogonal 
frequency-division multiplexing (OFDM) transmission scheme 
with N subcarriers is used and focus on the problem of 
minimizing a global objective function under fixed power 



constraints. As illustrated in [5] and 0151 . when cooperation 
among subcarriers is allowed, the optimization problem is 
formally equivalent to (H) both for the linear and non-linear 
architecture (clearly, in the latter case the optimization has 
also to be done with respect to the backward matrix B, 
just as discussed in Section II. E). This means that if / is 
additively (multiplicatively) Schur-concave or Schur-convex 
then the optimal processing matrices have the same form as 
before for the linear (non-linear) flat-fading case. A similar 
result holds true for the minimization of the total power 
consumption (problem V2 )■ 

Consider now the less general case in which an independent 
linear processing must be performed at each subcarrier. De- 
noting by G n ,XJ n and F„ the processing matrices operating 
over the nth subcarrier and calling MSE,^ the MSE of the 
kth symbol over the nth subcarrier, the optimization problem 
can be formalized as follows: 



min /(3i,32, ■ • -,9n) 



(37) 



s.t. 3„ = / n (MSE„, 1 ,MSE„ i2 ,...,MSE„ )K ) n = l,2,...,JV 



N 



E tr{U„U^} < P S 

n=l 
N 

E tr {F„ (H SR , n U n U%H SR , n H + pI Nn ) F»} < P R 



where / : R ->• R and /„ : R — > R are generic 
objective functions while H.s R . n € C rX s denotes the 
source-relay channel matrix over the nth subcarrier. Inter- 
estingly, if / and f n are increasing in each argument, the 
above optimization problem can be greatly simplified using the 
primal decomposition technique which allows to decompose 
the given problem into N independent subproblems controlled 
by a master problem. The latter is given by 



min /({Ps,4,{P fi ,n}) 

{Ps,n>0},{P R , n >0} 

N N 

s.t. E Ps,n < Ps and £ P R , n < P R 

n—1 n—1 



(38) 



where Ps. n and P R . n denote the power allocated over the 
nth subcarrier by the source and relay, respectively, while 
f({Ps,n}, {Pr,u}) = /(Si,ff2, ■ • • ,9n) with each g n corre- 
sponding to the minimum value of the cost function f n in the 
following subproblem: 

miii g n = / n (MSE M ,MSE„, 2 , . . . , MSE„^) 

G n , U n , F n 

(39) 

s.t. tr{U n U^}<P s , n 

tr {F„ (Hsfl,„U„U^H sfl , n ff + pI Na ) F^} < P R , n . 



Using the above procedure, the solution of (137t can be effi- 
ciently computed as follows. The master problem in (l3Ft can 
be solved using the same techniques illustrated and analyzed in 
l2T1l while the nth subproblem turns out to have the same form 
of (O once / is replaced with /„. This means that when f n 
is additively Schur-concave or Schur-convexd the solution of 

8 Observe that in the carrier cooperative scheme the global cost function / is 
required to be additively Schur-concave/convex, whereas in the noncooperative 
scheme each f n must be additively Schur-concave/ convex. 
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each subproblem can be computed in closed form as shown 
previously for the linear fiat-fading case. Such a scheme is 
known in the technical literature as carrier-noncooperative ap- 
proach with optimal power allocation 1221 . Alternatively, the 
design of the processing matrices can be performed under the 
assumption that no-cooperation is allowed and a fixed power 
(for example, a uniform distributed power) is allocated to each 
subcarrier. This scheme is simply referred to as a carrier- 
noncooperative approach and it is again formally equivalent 
to ©. As expected, the carrier-cooperative approach performs 
better than the carrier-noncooperative ones especially when 
highly frequency selective channels are considered 0. Indeed, 
in these circumstances the frequency diversity of the channel 
provides additional degrees of freedom that with cooperating 
subcarriers can be exploited to improve the system perfor- 
mance. 

The problem of minimizing the total power consumption in 
OFDM-MIMO systems in which several types of services are 
supported through spatial multiplexing has recently been in- 
vestigated in 11231 . Since in practical applications the reliability 
of each type of transmission depends on a global performance 
metric measured over the assigned subcarriers, differently from 
[6] the power minimization problem is reformulated assuming 
that the QoS constraint of each service is given as a generic 
Schur-convex function of the MSEs over all subcarriers rather 
than as a set of constraints on individual MSEs. Interestingly, 
it turns out that the solution of this problem reduces to the one 
illustrated in J6) for both a linear and a non-linear architecture. 
The only difference with respect to J6] relies on the structure 
of the unitary matrix to apply to the transmitted data symbols 
at the source and destination nodes. 

G. Acquisition of channel state information and robust opti- 
mization 

As seen, the optimization of a one-way two-hop MIMO 
system requires explicit knowledge of the source-relay and 
relay-destination channel matrices. In principle, channel acqui- 
sition at the receiver (relay and destination) can be obtained 
using the same methods employed in conventional single-hop 
MIMO networks (see for example [1241 and references therein). 
Specific algorithms for channel estimation in AF relay systems 
are also available in literature. For example, the estimation of 
both the source-relay and relay-destination channels can be 
performed directly at the destination node through the pilot- 
based schemes recently proposed in [251 an d ll26l . On the 
other hand, channel acquisition at the transmitter (source and 
relay) is a more demanding task. A possible solution relies 
on exploiting the channel reciprocity between the forward 
and reverse links and it is suited for open-loop systems. 
Unfortunately, the reciprocity is only valid for the "over the 
air" (i.e., from antenna to antenna) segment while channel 
estimation is usually performed at the baseband level after 
the radio-frequency chain. This calls for efficient calibration 
schemes not easy to be implemented 11271 . An alternative 
approach consists in performing the estimation process at the 
receiver and feeding channel measurements back through a 
reliable reverse link. This strategy is suited for closed-loop 



systems and it is nowadays considered as the most promising 
solution for commercial applications. 

Motivated by the above discussion, we focus on closed- 
loop techniques in the next. This amounts to saying that the 
estimation of Hsr is performed at the relay whereas the 
task of estimating Hrd is left to the destination node. Once 
estimates of Hsr an d Hrd are available, one has to decide 
how they should be shared among nodes for the computation 
of the optimal processing matrices. A possible solution is to 
make use of a distributee^ algorithm in which each node 
computes its own processing matrix. This means that the 
estimates of Hsr and Hrd must be sent from the relay to the 
destination and from the destination to the relay, respectively, 
whereas they must be both transmitted to the source node via 
a feedback channel from the relay. An alternative approach is 
represented by a centralized algorithm in which only a single 
node computes all the optimal processing matrices and then 
transmits them to the others. Clearly, the centralized strategy 
requires the computation node to acquire information about 
all the propagation channels either by direct estimation or via 
feedback links. 

Remark: As mentioned in the Introduction, this work is 
mainly focused on transceiver design under the assumption 
that source and relay have perfect or at least partial knowledge 
of the propagation channel. As is well-known, when this 
assumption does not hold true and no channel state information 
is available during transmission, one may resort to space-time 
coding techniques as a means to significantly improve the link 
reliability and spectral efficiency of wireless communication 
systems. Since the pioneering work of Tarokh et al. in 1281 . 
space-time coding has gained a lot of interest both from 
academia and industry and a large number of publications 
has flourished in the literature in a few years (see [|29l and 
references therein). Most of the research activity has been 
primarily focused on single-hop MIMO systems but it has been 
recently extended to relay networks with single or multiple 
antennas under the name of cooperative diversity or user 
diversity (see for example 11301 ). Herein, the multiple terminals 
(source and/or relays) cooperate to create a virtual antenna 
array that provides some form of spatial diversity. Due to 
space limitations, we cannot provide a detailed description of 
the possible architectures but we refer the interested reader 
to BTI - ll40l (and references therein) for a comprehensive 
overview of the literature available on this subject. 

1) Robust Linear Optimization: Assume that estimates 
Hsr and Hbd of the source-relay and relay-destination chan- 
nels are available. The simplest solution is to use them in place 
of Hsr and Hrd for the computation of the optimal process- 
ing matrices. Although simple, such an approach leads to a 
substantial performance degradation with respect to the perfect 
knowledge case. An alternative route consists in designing 
the optimal matrices taking the channel estimation errors into 
account. This leads to the so-called robust optimization design 
in which either min-max approaches or stochastic methods 
are usually employed BTI . In the sequel, we concentrate on 

'The word "distributed" has several meanings in wireless communications 
but herein it is used to refer to a system in which each node makes only use 
of the channel state information available locally. 
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stochastic methods and review some recent works developed 
specifically for a linear AF architecture. 

The robust design of (G, F) that minimizes the sum of the 
MSEs with a power constraint at the relay has been addressed 
in l42l . The main difference with respect to the perfect channel 
knowledge case is that all the statistical expectations involved 
in the optimization problem are computed also with respect to 
the distribution of the channel estimation errors. To this end, 
in 11421 the following error models are assumed (see also ll43l 
and B4l and references therein): 



H 



SR 



and 



where 



and 



H 



RD — 



H 



RD 



SR 



*RD 



^SR 



SR^SR* SR 



1/2 



(40) 



(41) 



(42) 



y.1/2 A (* 

^ nnii 1 



1/2 



'RD^RD ^ RD 

are the channel estimation error matrices 



elements of A 



SR 



e 



C Nr 



xN 



(w) 
RD 



(43) 

In particular, the 

G C NsxNli ) are 



independent and identically distributed zero-mean complex 
Gaussian random variables with unit variance, while Hsr € 

C N R xN R e C N 3 xN 3 j and yT R € c NsxN s £ 

C JVhxJVr ) are the row and column co variance matrices of 
As/j (Afljj), respectively. Assuming that the covariance 
matrices are perfectly known and that tlsR and 'H.rd are 
given, the channel uncertainties in (l40l > and fiTt are only 
represented by and A^> in d42l and d43l . In these 

circumstances, it is easily recognized that the a posteriori 
distributions of the matrices Hsr and Hud in (l40l and (|4"TT i 
follow the well-known Gaussian-Kronecker model l42l . 

It is worth observing that the expressions of EgR, *sii, 
and VI/aD depend on the specific channel estimation 
algorithm. For example, it can easily be shown 11421 Remark 
1] that the above model includes two well-known Bayesian 
MMSE estimators proposed in literature 11431 and lH4l . 

The robust design of (G, U, F) has been addressed in 
P51 for a flat-fading scenario and in ||46) for an OFDM 
system operating over a frequency-selective fading channel. 
The extension of these results to additively Schur-concave and 
Schur-convex functions can be found in [47 1 and is now briefly 
reviewed. 

We start observing that the optimization problem in ((8), 
when all nodes have imperfect channel state information, takes 
the following form l47l : 



Vi : 



mm 

G,U,F 



E 



fe.fe 



1,2 Jf 



s.t. tr|UU H }<P s 
tr |f (llsR 



(44) 



SR 



pi 



N, 



where a = tv{XJU h *&sr} and E = E Hsk .h hd {E} denotes 
the expectation of the MSE matrix in (0 computed with 
respect to the statistical distributions of Hsr and Hbd in 
d40b and (UQ. This yields 



E = GAG ff GHU U"H U G 



where we have defined H = H^dFHsr and 



a = f3n RD F ( h sr vu h h% r 



with 



SR, 



Pi 



Nr 



¥ H H H RD V 



RD" 



/3 = tr{F(H si? UU ff Hf fl + aV S R + P I Nr )F h * rd }. 

The optimal G in d44t is the Wiener filter given by G = 
U ff H fl A _1 . On the other hand, finding the optimal U and 
F for arbitrary covariance matrices is very difficult. In WT\ 
their explicit structure is provided only when the row and/or 
the column covariance matrices are equal to scaled identity 
matrices. For example, assume that ^sr — 6 srJ-n r an d 
S_r_d = crd^Ns and let the SVDs of TIsr and W. R d t> e 
respectively given by 



H 



SR 



^H A T V £ 

fss Hsr h sr 



and 



H 



RD 



n 6 \f 

HrD Har 



V 



In the above circumstances, if additively Schur-concave func- 
tions are considered the optimal U and F take the form 11471 



and 



U 



F = V 



Hr 



Ay 2 n H 



Hsr 



(45) 



(46) 



where tt 



Hsr 
, V 



HSR tiRU 



correspond to the K columns 
and Va associated to the K largest singu- 

Hsr. Hsr. Hr D ° KxK 

lar values. The entries of the diagonal matrices Ajj G C 
and Ap G C KxK can be obtained through the iterative 
method developed in (5). If / is a Schur-convex function, the 
optimal F is as in ( |46l i while the optimal U is given by 



U 



(47) 



with S being a proper unitary matrix chosen such that the 
diagonal elements of E are all equal to the arithmetic mean 
of its eigenvalues. 

As mentioned before, in ll42l and B31 - l47l it is shown that 
a robust design provides better performance compared to the 
simple scheme in which the channel estimation errors are not 
taken into account and the estimated channel matrices Hsr 
r < ^Rand 11 R d are simply used in place of Hsr and H R d. 
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III. Optimization of a one-way multi-hop MIMO 

SYSTEM 

In case of a long source-destination distance, multi-hop 
communications may be necessary to carry the information 
from the source to the destination. Consider for example 
a linear L— hop MIMO relay system consisting of L — 1 
relays each equipped (for notational simplicity) with the same 
number N$ of antennas employed at source and destination 
nodes and call G C Ns x s the channel matrix between the 
ith and the (i — l)th nodes or hops (see Fig . [5j ■ Also, denote by 
F G C NsxK and F, G C NsxNs for i = 1, 2, . . . , L - 1 the 
source and the ith relay processing matrices, respectively. In 
these circumstances, the vector received at the ith relay takes 
the form 

= HiXj_! + n,, i = l,2,...,L (48) 

where xo = FqS with s e C Kxl and x,; G (£, NsXl for i 
1.2...../. lis the signal vector transmitted by the ith relay 
given by 

x 4 = F,r 4 (49) 
while rii G (j^^xi accoun t s f or thermal noise. 

Some recent works on the multi-hop MIMO systems de- 
scribed by the above model can be found in 11481 - BP . In 
particular, in ||48l the asymptotic capacity is derived under the 
assumption that Fj for i = 1, 2, . . . , L — 1 is a scaled identity 
matrix while the capacity scaling law with an asymptotically 
large number of hops is computed in fl49l . In ll50l the authors 
investigate the achievable diversity gain when diagonal relay- 
ing matrices are used. In lIBTl the optimal {F^} are found by 
neglecting the noise at the relay nodes. 

The first attempt at designing all the involved processing 
matrices according to a different optimization criterion can be 
found in l52l in which the minimization of additively Schur- 
concave/convex objective functions subject to average power 
constraints at the source and the relays is considered. Although 
conceptually similar to the one discussed in Section II, the 
above problem is much more involved since the objective 
function now depends on all the relay amplifying matrices 
and the power constraint at each relay node is a function of 
the processing matrices of all backward nodes. Interestingly, it 
turns out that the solution of such a complicated problem has 
the same form of the channel-diagonalizing structure found 
before for a two-hop system. Furthermore, this elegant result 
is valid for any arbitrary number L of hops. Mathematically, 
G is the Wiener filter while the optimal F and Fj for 
i = 1, 2, . . . , L — 1 are given by |52l 

F = V Hl \y 2 S H (50) 

and 

F< = Vfl-^A^nf, (51) 

where S G <C KxK is unitary while fljj i G Q NsxK and 
Vjt 4 G C Ns xK are obtained from the SVD of H;. In addition, 
Au G C KxK and {A^ G C KxK } are the diagonal power 
loading matrices that can be designed for example using the 
alternating power loading algorithm described in Section II. 
As for two-hop systems, S is equal to the identity matrix 




Fig. 6. Block diagram of one-way two-hop linear MIMO systems with 
multiple relays. 

when additively Schur-concave functions are considered while 
it must be such that the overall MSE matrix has identical 
diagonal elements for additively Schur-convex functions. All 
the above results have been later extended to non-linear 
architectures in lfl5l while the minimization of the power 
consumption with QoS requirements is discussed in J6). 

Although interesting from a mathematical point of view, 
the realization of the above optimized multi-hop system is 
a challenging task since centralized processing is required 
to compute the optimal Au and {A^}. This may lead 
to a system with high computational complexity and large 
signaling overhead. To overcome this difficulty, simplified 
algorithms are proposed in ll53l in which the optimization of 
the relay matrices is carried out locally at each relay node 
while maintaining comparable performance with respect to 
the optimal one. This is enabled by the observation that the 
optimal processing matrix of each relay can be rewritten as 
the combination of two linear filters that allow to decompose 
the overall MSE matrix E into the sum of the MSE matrices 
at all relays. 

When the instantaneous channel state information is only 
available at the destination but it is unknown at the source 
and the relays, the structure of the optimal source and relay 
amplifying matrices that maximize the source-destination er- 
godic sum capacity is derived in 0541 using knowledge of the 
channel covariance matrices. 

Remark. When the source is far away from the destination 
and a large number of hops is required, the noise propagation 
effect of non-regenerative relays makes data recovery at the 
destination almost impossible for practical values of SNRs. 
In these circumstances, a combination of regenerative and 
non-regenerative relays should be used to provide a good 
tradeoff between the end-to-end processing delay and error 
rate performance ll52l . 

IV. Optimization of a one-way two-hop MIMO 

SYSTEM WITH MULTIPLE (PARALLEL) RELAYS 

The block diagram of a linear one-way two-hop MIMO 
system with multiple (parallel) relays is shown in Fig. [6] 
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in which the number of active relays is denoted by Q and 
the matrix collecting the channel gains between the source 
and the qth relay (the qth relay and the destination) for 

q = 1,2,..., Q is called H sfl „ G C ffj,XJTs (H* n e 



-i if xl 



In these circumstances, the vector y G 
can be expressed as in ([D with the only difference that 



the matrices U SR G C NrQ 
now take the form Hsr = 

H.RD = \H-Rj_D Hr 2 D ' ' • H 



xJVs 



and Hrd G 



H ffi 2 



^NsxNrQ 

and 



iiQ-Dj 



while F G C NrQxNrQ 



block diagonal and given by F = diag { F i , F2 , . . . , Fq } . 

In 1551 the authors set G and U equal to the identity matrix 
and attempt to find the optimal structure of F minimizing 
the sum of the MSEs subject to a global power constraint 
at the output of the relays. Unfortunately, the solution is 
found in closed-form only for the simple case in which relays 
are equipped with a single antenna, i.e., Nr = 1 while 
the multiple antenna case is addressed without imposing any 
power constraint at the relays. The design of F that minimizes 
the total power consumption while fulfilling a given set of SNR 
constraints is investigated in (56) and a power efficient solution 
is derived in closed-form after solving a two-step optimization 
problem. The extension to a system in which U and G may 
have a general structure has recently been considered in [|57l 
for the minimization of the sum of the MSEs. In particular, in 
ll57l it is found that the optimal G is the Wiener filter while 
the optimal U and F are such that 



U = V 



A 1/2 



and 



H RD F = PAp /2 f2g 



Hsr/ 



(52) 



(53) 



iKxK 



is an arbitrary unitary 



ary 

•>Ns 



xK 



In the above equations, P G C 
matrix while &Hsr G <C® NrXK and Vh sr G 
obtained from the SVD of Hsr- From the above results, it 
follows that the diagonalization of the overall channel matrix 
H = H#dFH,s,rU is achieved up to a unitary matrix P. 
Unfortunately, in 157) such a matrix is arbitrarily set equal to 
the identity since finding its optimal structure is an extremely 
hard problem whose solution is not known yet. On the other 
hand, the evaluation of the elements of the diagonal matrices 
A;/ and Ap requires to solve a non-convex power allocation 
problem which can be closely approximated using the same 
arguments outlined in Section II. 

Fig. Q illustrates the BER of the system designed in ||57) 
as a function of the SNR over the source-relay links when 
the symbols belong to a 4— QAM constellation and Q varies 
from 2 to 5. In addition, the SNRs over the relay-destination 
links are fixed to 20 dB. As expected, increasing the number 
of relays improves the system performance. Numerical results 
not shown for space limitations show that a substantial im- 
provement is achieved with respect to ll55l in which only the 
processing matrix F is optimized. 

The above results have recently been extended in 11581 
to a system in which a DFE is used at the destination. In 
particular, it is found that the channel-diagonalizing structure 
is optimal provided that two unitary matrices are used at the 
source (destination) to rotate (counter-rotate) the transmitted 




Fig. 7. BER of a one-way two-hop MIMO system as a function of the SNR 
over the source-relay links when Q = 2, 3 and 5 multiple relays are used 
and Ng = Nr = K = 3. In addition, the SNRs over the relay-destination 
links are fixed to 20 dB. 



(received) symbols. As before, finding the optimal structure of 
the above unitary matrices is an open issue. 

The optimization of (G, F) for minimizing the power con- 
sumption subject to MSE-based QoS constraints is investigated 
in ll59l with either an optimal or non-optimal source precoding 
matrix U. 

V. Optimization of a one-way two-hop MIMO 

SYSTEM IN THE PRESENCE OF DIRECT LINK 

When the direct link is sufficiently strong, the exchange of 
information between source and destination still requires four 
phases as in the case in which the direct link is negligible. 
However, differently from this latter case both destination 
and relay receive the source signal in the first phase. On the 
other hand, during the second phase the source node is silent 
and the relay node sends the linearly precoded signal to the 
destination. In these circumstances, the vector y turns out to 
be in the same form of (fl~|i with the equivalent channel matrix 
H = HrbFH.sr replaced by H (DL ) = [Hf D , H H ] H ", 

iNsXNs 



where H SD G C" b ^" s is the source-destination channel 
matrix. In addition, the covariance matrix of the noise vector 
n is found to be pH[^ L ^ with 



IjV S OjVg 

0jv s Rn 

and R n given by 

Although there exists much ongoing research, the problem 
of jointly designing (G, U, F) when the direct link is present 
is still much open. A first attempt can be found in J5) in which 
the maximization of the mutual information is considered and 
the design of F is carried out under the assumption that no 
operation is performed at the source and destination. This 
amounts to setting G and U equal to the identity matrix. 
Unfortunately, finding the solution of the above problem turns 
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out to be extremely challenging and only upper- and lower- 
bounds of the mutual information are used to compute F. 
The same problem is considered in ||4] wherein a suboptimal 
structure of F is derived following a different line which does 
not take the power constraint at the relay into account. In ll60l . 
U is chosen equal to the identity matrix and the joint design 
of (G,F) is based on the following optimization problem 

£in /([E] fcifc ;fc = l,2,...,A') 

(54) 

s.t. tr {F {HsbHsr + pI NR ) F H } < P R 

where / is a generic increasing function of its arguments. As 
shown in ||60l , the optimal G is again the Wiener filter while 
the relay processing matrix F is found solving d54l i with E 
now given by 

E = p (Hf D H S D + H ff R- x H + pl K y l . 

From the right-hand-side of the above equation, it is seen that 
differently from (fT~2T > an additional term H^ d Hsd accounting 
for the direct link is present in the MSE matrix. The optimal 
F in d54l i has the following form (60] 

f = y Hrd aci» sr 

where A € C KxK is an arbitrary matrix. The above result 
indicates that F is a general linear beamforming matrix 
matched to the left (right) singular vectors of the source-relay 
(relay-destination) channel. The only difference with respect to 
the case in which the direct link is neglected is that A is not 
diagonal unlike Ap in ( fT5t . This means that the optimized 
structure does not lead to a diagonalization of the overall 
communication system. In l60l a closed form solution for the 
optimal A is provided only for the simple case in which a 
single antenna is employed at the source. On the other hand, 
when multiple antennas are employed A is optimized only by 
means of numerical methods. 

In |6"T1 the authors deal with the problem of designing 
(G,U,F) so as to minimize the sum of the MSEs subject 
to power constraints at the source and relay. As expected, the 
optimal G is found to be the Wiener filter while the joint 
design of U and F is addressed by means of a suboptimal 
two-stage procedure not based on any optimality criterion. The 
same problem is addressed in ifTUl where it is shown that the 
optimal F has the following structure: 

F = V HnD An H 

where A G C KxK is an arbitrary matrix while O 6 C N " xK is 
obtained from the SVD of Hg^U = CIAV H and corresponds 
to the K columns of f2 associated to the K largest singular 
values. As seen, F is given as a function of U and A. 
Unfortunately, in ifTol no closed-form is provided for the 
joint design of U and A. Only a suboptimal solution making 
use of gradient-based numerical methods is discussed. Fig. 
|8] illustrates the BER of the algorithm developed in ifTOl as 
a function of the SNR over the source-relay link when a 
4— QAM constellation is used and the SNRs over the relay- 
destination and source-destination links are respectively given 
by 20 dB and —10 dB. Comparisons are made with respect to 
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Fig. 8. BER of a one-way two-hop MIMO system as a function of the SNR 
over the source-relay link in the presence of direct link when K = N$ = 2 
and Nr = 6. In addition, the SNRs over the relay-destination and source- 
destination links are fixed to 20 dB and —10 dB, respectively. 

the NAF algorithm in which no operation is performed at the 
source and relay and with the solution illustrated in \5\ and 
discussed in Section II in which the direct link is neglected. 
As seen, some improvement is achieved taking into account 
the direct link. 

Non-linear architectures in the presence of the direct link 
have recently been investigated in l62l and l63l . In particular, 
in ||62| a DFE-based system is considered. As for the linear 
case, it turns out that the joint optimization of the processing 
matrices is hard to address. To overcome this difficulty, the 
authors propose a suboptimal solution in which the precoding 
matrix U is constrained to have a unitary structure. On the 
other hand, in l63l THP is employed at the source and only 
two suboptimal approaches are proposed. The first operates in 
an iterative manner and relies on the THP design proposed in 
l64l and on the relay precoder scheme derived in 11611 . The 
second one provides a closed-form solution for the structure 
of U and F but it is again based on the assumption that U is 
unitary. 

VI. Optimization of a two-way two-hop MIMO 

SYSTEM 

As seen, a one-way two-hop system requires four-phases 
to exchange the information between source and destination. 
Since orthogonal channels (in time- or frequency-domain) 
are used to implement each phase, such a system spends 
twice as much channel resources with respect to a direct 
communication. To reduce this penalty, the two-way protocol 
originally proposed in ll65l (see also ll66l and references 
therein) allows source (user 1) and destination (user 2) to 
simultaneously transmit during the first phase. The received 
information at the relays is then forwarded to user 1 and 2 
in the second phase. Since both user 1 and user 2 know their 
own transmitted data, they can remove the self-interference 
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Fig. 9. Block diagram of two-way two-hop linear MIMO systems. 

from the received signal provided that the required channel 
state information is available. This leads to a scheme with the 
same spectral efficiency of direct communications and at the 
same time able to take advantage of the potential benefits of 
relay communications. 

The block diagram of a two-way two-hop MIMO relay 
network is shown in Fig. [9] in which s, for i = 1, 2 is used to 
denote the symbol vector transmitted by the ith user. In these 
circumstances, the signal received at the relay during the first 
phase can be expressed as 

y R = Hi^Usi + H 2 i?Gs 2 + 

where Hm € C^" x Ns for i = 1,2 denotes the channel 
matrix from the ith user to the relay while n R accounts for 
thermal noise with zero mean and covariance matrix given by 
pJ-N R - At the relay, the vector y^ is first processed by a matrix 
F € C NrxNr anc j then forwarded to both users during the 
second phase. The received vector y$ for i = 1, 2 takes the 
form 

yi = H R1 FH 2R Gs 2 + H m FH 17? U Sl + H R1 Fn R + m 

useful signal self-interference noise contribution 

(55) 

and 

y2 = HffiFHiflUsi + H fi2 FH 2J? Gs 2 + H^Fn^ + n 2 

useful signal self-interference noise contribution 

(56) 

where Hjjj € C NsX R for i = 1,2 denotes the channel 
matrix from the relay to the ith user and for i = 1, 2 
is Gaussian with zero mean and covariance matrix pljv s . 
As explained earlier, from (IB31 it is seen that if user 1 has 
knowledge of H^iFHi/? it may remove its self-interference 
term from the received signal. The same can be done by user 
2 if H/? 2 FH 2 /f is known. Then, the vectors at the input of 
the decision devices are given by z\ = H' 1 Gs 2 + and 
z 2 = H 2 Usi + n 2 where we have defined = H#iFH 2 /j 
and H' 2 = H^ 2 FHifl while and n' 2 are respectively given 
by n[ = H R1 Fn R + rii and n 2 = H R2 Fn R + n 2 . 

Some recent works on two-way two-hop MIMO relay 
networks can be found in ll67l and l68l (see also [[691 and 
references therein). In particular, in ll67l the authors con- 
centrate either on the maximization of the achievable sum 
rate or on the minimization of the sum of the MSEs while 
imposing power constraints at the terminals and relay. In both 
cases, they develop an iterative algorithm based on the gradient 
descendent technique that allows to numerically compute an 
approximation of the optimal (G, U, F) (even for the case in 
which multiple relays are used). On the other hand, in ll68l 
the authors deal only with the maximization of the achievable 



sum rate and demonstrate that when N R > 2Ns the optimal 
F takes the form 

F = QiAQf (57) 

where A 6 C 2iVsx2JVs j s an arbitrary matrix while 
(Qi,Q2) € (£<N R x2Ns are sem i-unitary matrices obtained 
from the following two QR decompositions: 

[ H fi Hjf 2 ] = QiRi 

and 

[Hijj H 2R ] = Q2R2 

with (Ri,R 2 ) G C 2 s " 2 s being upper triangular. On 
the basis of the above result, it follows that the original 
problem reduces to jointly design (G, U, A). Clearly, this 
does not change the nature of the problem but it has the only 
practical relevance to lead to a simplification of the problem 
in those applications for which N R > 2N$- Unfortunately, in 
l68ll it is shown that the optimal structure of the processing 
matrices (G, U, A) is hard to find in closed-form since the 
optimization problem is not convex. This is in sharp contrast 
to the one-way relay systems discussed in Section II in which 
the optimal structure of (G, U, F) are found in closed-form 
and leads to the diagonalization of the entire relay system. 
To overcome this difficulty, a couple of numerical methods 
based on iterative procedures in which A and (G, U) are 
alternatively optimized are proposed in ll68l . In particular, 
the optimization of A for a given set of matrices (G, U) 
is performed by means of two different algorithms. The first 
one is a hybrid algorithm in which the gradient descendent 
search and the Newton's method are adaptively combined 
while the second one is inspired by the weighted minimum 
MSE algorithm originally proposed in 1701 . On the other hand, 
the optimization of (G, U) for a given A requires to solve 
a convex problem whose solution is found by means of the 
generalized waterfilling algorithm developed in ||7T| . 

From the above discussion, it follows that channel knowl- 
edge in two-way relay systems plays a key role since it is 
not only necessary for the design of the processing matrices 
but also for self-cancellation purposes. Although the problem 
of channel estimation in one-way relay systems has gained 
some interest, little work has been done for the two-way 
relay protocol. The problem has been recently investigated in 
f72l in which a two-phase training-based algorithm is derived 
according to two different criteria. The first one relies on 
the maximum-likelihood methodology while the second one 
is derived so as to maximize the SNR at the receiver after 
taking the channel estimation errors into account. A possible 
drawback of these schemes is that they do not perform channel 
estimation at the relay in which only a scaling operation 
is performed. This means that the required channel state 
information can be provided at the relay only via a feedback 
channel. A different approach is illustrated in lf73l in which 
channel estimation is performed at the relay during the first 
phase and it is then used in the second one to properly allocate 
the power so as to improve the channel estimate quality at 
the terminal nodes. An alternative approach using the parallel 
factor analysis is proposed in J74). 
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VII. Extensions and future lines of research 

This tutorial has discussed the optimization of AF MIMO 
relay systems. A number of key architectures has been re- 
viewed and investigated under different design criteria and 
operating conditions. As seen, the optimization of a one- 
way two-hop MIMO system is a well-understood issue when 
perfect knowledge of the channel matrices is assumed. Much 
work has still to be done for the development of robust 
solutions able to cope with channel uncertanties. On the 
other hand, different aspects in the optimization of other 
architectures remain unsolved and/or to be further investigated. 
An interesting problem common to all the investigated systems 
is the deployment of decentralized algorithms that may provide 
a good tradeoff between performance and system scalability. 
In addition to all this, there are still many extensions and 
future lines of research related to the optimization of MIMO 
relay networks, some of which are briefly described in the 
next. For example, the optimization of multiuser MIMO relay 
systems has only recently become an active research topic. In 
particular, the multiaccess MIMO relay networks are addressed 
in l53l - ||54l while the multiuser broadcasting relay systems 
are investigated in ITD . Compared to its single-user counter- 
part, the optimization of a multiuser system is much more 
challenging due to the considerable amount of required chan- 
nel state information and computational complexity. Another 
interesting direction is the interference MIMO relay network 
where multiple source nodes communicate with their desired 
destination nodes with the aid of (distributed) relay nodes lfl4l 
and 1751 . Due to the existence of interference, a cross layer 
design between the MAC layer and the PHY layer may be 
employed to optimize the overall system performance. Also, 
the optimization of full-duplex relay networks in which loop- 
back interference is traded for higher spectral efficiency is 
worthy further investigation ll76l . 
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